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games 


R.K. Gupta*® and D. Khan 


Abstract 


When we design the payoff matrix of a game on the basis of the available 
information, then rarely the information is free from impreciseness, and as 
a result, the payoffs of the payoff matrix have a certain amount of ambi- 
guity associated with them. In this work, we have developed a heuristic 
technique to solve two persons m x n zero-sum games (m > 2,n > 2), with 
interval-valued payoffs and interval-valued objectives. Thus the game has 
been formulated by representing the impreciseness of the payoffs with in- 
terval numbers. To solve the game, a real coded genetic algorithm with 
interval fitness function, tournament selection, uniform crossover, and uni- 
form mutation has been developed. Finally, our proposed technique has 
been demonstrated with a few examples and sensitivity analyses with re- 
spect to the genetic algorithm parameters have been done graphically to 
study the stability of our algorithm. 


AMS subject classifications (2020): 45D05; 42C10; 65G99. 


Keywords: Two persons zero sum game; Interval-valued payoffs; Genetic 
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1 Introduction 


In this paper, an effort has been made to solve two persons m x n zero-sum 
games (m < 2,n < 2), with interval-valued payoffs and interval-valued ob- 
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jectives. A payoff matrix with interval-valued payoffs has been considered. 
Normally, when we design the payoff matrix of a game on the basis of the 
available information, then rarely the information is free from impreciseness/ 
vagueness, and as a result, the payoffs of the payoff matrix have a certain 
amount of ambiguity associated with them [8]. Due to this reason, the payoff 
received by a player as the result of interaction between any of his strate- 
gies with any strategy of his competitor is assumed to be interval-valued. 
Out of the several types of researches done on games, a large fraction has 
been based on two-person games like Blackwell’s work considering vector pay- 
offs [4], Zeleny’s investigation on games with multiple payoffs, and so on [32]. 
Nishizaki and Sakawa [25] worked on two persons zero sum games with multi- 
ple goals. Since the end of the last century, especially in the last two decades, 
researchers have given more emphasis on multiobjective games, considering 
impreciseness in payoffs and goals. In the majority of these researches, the 
impreciseness has been represented and dealt with the help of the fuzzy ap- 
proach. Researchers like Aubin [1, 2], Butnariu [5, 6], Campos [7], Nishizaki 
and Sakawa [26], Bector and Chandra [3], Vijay et al. [31], Cunlin and Qiang 
[11], Dutta and Gupta [12], Gong and Hai [15], Chandra and Aggarwal [10], 
Li [18, 19], Roy and Mondal [28], Jiang et al. [17], Qiu et al. [27], Madandar 
et al. [21] have all significantly contributed to the researches on the fuzzy 
games. 


However, in the case of fuzzy approach, the user arbitrarily defines the 
shapes of fuzzy numbers. It is assumed that the constraints, objectives, and 
parameters are fuzzy sets and that the membership functions are known to 
the user/decision-maker. However, sometimes the user or the decision-maker 
is unable to specify the membership function accurately and hence, has to 
resort to an arbitrary approach. On the other hand, while dealing with impre- 
ciseness using interval numbers, one can be absolutely sure that the interval 
results will always contain the exact result irrespective of whether its upper 
boundary and lower boundary are overestimated. No scope for subjectivity is 
present in this case. Thus in a way, the interval approach has some distinct 
advantages over the fuzzy approach. Because of this, in this paper, we have 
dealt with the impreciseness with the help of the interval approach; that is, 
the imprecise payoffs have been represented by interval numbers. The ob- 
jective function of the game is also interval-valued. In this paper, a genetic 
algorithm (GA) based heuristic technique of solving two persons zero sum 
games with interval-valued payoffs and objectives has been proposed, that is, 
to solve the game a real coded GA with interval fitness function, tournament 
selection, uniform crossover, and uniform mutation has been developed. GA 
is a widely popular heuristic search and optimization technique, first devel- 
oped by Prof. J. H. Holland, University of Michigan. Currently, there are 
several textbooks on GAs by authors like Goldberg [14], Michalewicz [24], 
Sakawa [29], Gen and Cheng [13] and others. While implementing GA to 
solve the game, for selection operation and also for finding the best chromo- 
some in each generation, we have used the concepts of interval arithmetic 
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and order relation of interval number. The contributions of researchers like 
Ishibuchi and Tanaka [16], Chanas and Kuchta [9], Sengupta and Pal [30], 
Mahato and Bhunia [22] in defining ordered relations of interval-valued num- 
bers are worth mentioning. 


2 Concepts of interval numbers and their comparisons 


Let K = (kr, kr] = {@: kp < a < ke, kr, kr,x € R} be an interval-valued 
number, where ky and kp are the left and right boundaries of K, respectively. 
Also, assume that K = (kC, kr) = {a :kokp <a <ko +ky,,x € R}, where 
ko = (kr + kr)/2 and k, = (kr — ky)/2, are perspectively the center and 
radius of the interval, and R is the set of real numbers. If there are two 
closed interval numbers K and G(= [gz,gr]) and A € (+,—,0,1) is a binary 
operation on the set of real numbers, then the binary operation on interval 
K and G is defined by KAG = {kAg: k € K and g € G}. In the case of 
division, it is assumed that 0 ¢ G. Thus, 


(i) A+G= [kr + 91,kr + gr, 
(ii) K-—G=|k, — gr, kr - gz), 


[Akz, Akr] if \ > 0, 


[Akp, Akz] if A <0 where ) is a real number. 
R» LI) 1 4 


(iii) AK = 
(iv) KxG= (kr, kr] x [97 9R| 


[Min(krgr,kxrgr. krgt,krgr), Mar(kygt, krgr, kr, gt, krgr)); 
[Iki gx, krgr|, only if kp > 0 and gL = 0. 


Considering the optimistic decision making for maximization problems, the 
order relation > 9 max between the intervals K and G is defined as 
(v) K >omax G if and only if ky > gr, 


(vi) K > omaxG if and only if K >omax Gand K £#G. 
Considering the pessimistic decision-making for minimization problems, 
the order relation >p max between the intervals K = [ky, kr] = (kc, kr) 
and G = [gr, gr] = (9c, gr) may be defined as 


(vii) K >pmax G if and only if kc > gc, when intervals K and G are either 
disjoint or partially overlapping. 


(viii) K >pmax G if and only if kc > gc and k, < g, when interval K is 
contained in interval G. 
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3 Methodology and formulation 


A popular and simple case of game theory is two persons zero sum game 
in which it is assumed that two players are involved, each having a finite 
number of strategies, and the algebraic sum of the gains and losses of those 
two players is equal to zero, that is, due to the interaction of any pair of 
strategies of the competitive players (say A and B), the amount received by 
one player is exactly equal to the losses of the other player. This can be 
illustrated with the following the m x n matrix: 


Qm1*** Amn 


where M € R™*” is an m X n real payoff matrix of player A. Here RY is the 
U-dimensional Euclidean space and a,; is the payoff of player A, when player 
A plays the strategy 7 and player B plays the strategy 7. 

A mix strategy of player A is given by the vector « in R', that is, nonnegative 
orthant of R™, such that 


1 ry 
t be 
Lem = 1 where em = |. and xr©=]| , 20 
1 mx1 Tm/ m1 


Thus if S”” is the strategy space of player A, then S” = {# € R'?, x em = 1}. 
Similarly, the strategy space of player B is given by S” = {VE R, Yt en = 
ik . 7 

Here, the vector y denotes a mix strategy of player B such that y’ e, = 1. 
If player A plays the mix strategy r and B plays the mix strategy y, then 
the expected payoff of player A is given by a*My = 7", 04 hte ay 
The player A’s mix strategy oF is said to be his optimal strategy if x M y# < 
at M y* for all rE S™, where y* is B optimal strategy. 


Similarly, the player B’s strategy y* is said to be optimal strategy if 
co My>a# My* forall yes”. 


Thus player A’s objective is to determine the optimum values of x;’s in such 
a manner that it can maxi-min its expected payoff for any values of the 
elements of the vector y that player B chooses. 

Hence from player A’s point of view, the game can be expressed by the 
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following problem: 


Maz Z(x2) = Min Ajj X4 1 
(2) = Min’) ay (1 
subject to x em, =1 
z*>0 
From B’s side, we have 
Min U(¥) = Maa 2 Qig Yj (2) 
j= 


subject to ye, =1. 

However, as discussed in the introduction section in real-life situation, the 
payoffs are generally imprecise, and here we have represented them by 
interval-valued numbers, that is, a;; = [aij,, dijzl- 

Thus 


m 


m m m 
s Aijti = s [Gj , 1 ij _ Pi = S Qj, Vi, 5 ijati| - 
i=1 i=l i=1 


i=l 


Hence (1) becomes 


Pe m m 
Max Z(x) = a 5 Qij_ vi, 5 Qijatil > 
~ i=1 i=1 


subject to ze, = 1 


z'>0. 


From B’s point of view, the problem is 
m m m 
Min U(Y) = Max » Qijr Yj; 2, Qijptj 
j= j= 


subject to y’e, = 1 


aa 
IV 
20 


4 Solution procedure 


We have developed a real-coded GA for solving the interval-valued game. 
The main algorithm and the broad working principle of GA are widely pop- 
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ular and are available in several books and journal papers. Hence, we do not 
present it here anymore. However, the finer details of its basic components 
like “representation of chromosome”, “initialization of population”, “evalu- 
ation function”, “Selection process”, and “Genetic operators (crossover and 
mutation)” are explained below in brief. Furthermore, here we explain the 
GA developed for solving the problem from player A’s side. For solving the 
problem from player B’s side, an almost similar approach is adopted. 

In the problem formulated by us, from player A’s side, there are “m” (> 2) 
continuous decision variables (each representing the probabilities with which 
each of the “m” strategies is played by player A). Hence the ith chromosome 


is represented by a real row matrix X;(A) = [Xi1, Xi2,...,Xim,], where 
Xji1,Xj2,---,Xim (such that ye Xi =1&0 < Xi; < 1 for all 7 = 
1,2,...m,) represent the decision variables, 71, 272,...,U%m, respectively, of 


the problem. 

After representing the chromosomes, the population size (popsize) numbers 
of chromosomes have been initialized. 

In this problem, for each chromosome, the first component (gene), that is, 
Xj, is initialized by randomly generating a real number between 0 and 1. 
Then, Xj is initialized by randomly generating a real number between 0 
and (1 — Xj). Similarly, X;3 is initialized by randomly generating a real 
number between 0 and 1 — (Xj1 + Xi2), and so on, until the value of the 
last component is taken as Xjm, = {1 — Sars X;;}. For each component 
of each chromosome, random numbers have been selected by using uniform 
distribution. 


In the evaluation function, the fitness value for each chromosome (i.e., 
a potential solution) is calculated. In our work, the fitness value for each 
chromosome is considered to be the value of the objective function corre- 
sponding to it. Using the selection operation, the below-average solutions 
are eliminated from the population for the next generation. In this work, 
the tournament selection scheme of size two with replacement has been im- 
plemented. In this selection scheme, at first, two chromosomes are randomly 
selected. Then out of these two chromosomes, the better chromosome (i.e., 
one having the better fitness value) is finally selected for the next generation. 
The selection of a better chromosome is made on the basis of definitions (vi?) 
and (viii) (discussed under section 2) of order relations between two interval 
numbers as the objective function of optimization problem be interval valued. 
Once the selection process is complete, the surviving chromosomes become 
eligible to take part in crossover operation, in which at the time two parent 
chromosomes get involved to generate offspring. These offspring possess the 
features of both the parent chromosomes. Here we have denoted the prob- 
ability of crossover by pobcross. In this work, the crossover operation (as 
shown in Figures 1 and 2) is done in the following manner: 
At first, pobcross* popsizeis is found and its integral value is stored in variable 
V. 
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(i) Then two chromosomes X;(A) and X,(A) are randomly selected from 
the population for crossover. For creating the rth (r = 1,2,...,m) 


component X. : and x of the two offspring, the following procedure 
is adopted. 


(ii) A real number “h,.” is randomly generated between 0 and |X;, — Xqrl. 
(iii) If Xip > Xqr, then X/ = Xip — hy and XJ, = Xqp + hy. 
Otherwise, if Xj, < Xqr, then X/ = Xi, + hp and Xf, = Xqp — hp. 


The above steps [(i) —(iii)] are repeated for V/2 times. Let the parent chromo- 
somes be X;(A) and X,(A), as shown below in Figure 1: Now if it is assumed 


xia — 


(Xinthm) 


——> X41 = X'q2= X gm= 
(Xg-hy) | (Xe thy) _ |... ae (Xon-hm) 


Figure 1: The parent chromosomes 


XA) 


that Xi, < Xqi, Xig > Xq2, and Xim < Xqm, then the child chromosomes 
x! (A) and x (A) will be given by (as shown below in Figure 2): By apply- 


BO pki [Xo | a | on 


X; (A) 
: Xql Xq2 Xq3 iets Xagm 


Figure 2: The child chromosomes 


ing the mutation operation to a single chromosome, random variations have 
been injected into the population. The objective of mutation is to push the 
population slightly towards a better path. Here uniform mutation has been 
used and the probability of mutation has been denoted by pobmute. 

If the elements (genes) X;,, and X;, of chromosome Xj; are selected for mu- 
tation, then |Xiw — Xiz| = b(say) is first calculated. Then the modified value 
of Xiw and X,, are given by 


Pare Xiw + A(b), if Xj, > Xiws snd a Xit + A(b), if Xjy > Xit; 
vy Xiw = A(b), if X it < Xiw; ut X; = A(b), if Xiw < Xits 
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where w € {1,2,...,m}, t © {1,2,...,m}, and A(b) = a is a random real 
number between (0, d]. 


5 Numerical example 


We have considered three examples and solved them with the proposed GA. 
Except for Example 3, in the other two examples, the payoffs are considered 
as interval-valued. However, in Example 3, they are considered as fixed (by 
taking identical values for the left boundaries and right boundaries of the 
intervals).The values of the parameters considered in these examples are all 
feasible, although they have not been selected from any case study. In each of 
the examples, 20 independent runs have been performed by the proposed GA, 
of which the best value of the game has been taken and the corresponding 
mix strategies of both the players are determined and displayed in Tables 1-6. 
The following values of GA parameters are used in this work: popsize = 100, 
pobcross = 0.95, pobmute = 0.15, maxgen = 1000. 


Example 1. 
[2, 7,3, 3] (0,8) = [—3,-]] 
M, = [—5, -1] [0, 0] (0, 2] 
[—1.5, —0.5] [—6, —2] [0.25, 3.75] 


Mixed Strategy of player A i.e.x 


Value of the game in interval 


form [Obj L, Obj R] 


Expected value of the game, i.e., 


center value of [Obj L, Obj R] 


0.403846 
x = | 0.230769 


0.365385 


[—1.120192, 1.427885] 
[—2.192308, 2.500000] 
[—0.611539, 0.919231] 


0.153846 


Table 1: Solution for player A 


Mixed strategy of player B, i.e., ¥ 


Value of the game in interval 


form [Obj L, Obj R] 


Expected value of the game, i.e., 


center value of [Obj L, Obj R] 


0.153846 [—1.1738462, 1.430769] 
y = | 0.230769 [—1.769231, 1.461538] —0.153846 
- 0.615385 [—0.076923, 0.769231] 
Table 2: Solution for player B 
Example 2. 
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Mixed strategy of player A, i.e., x | Value of the game in interval | Expected value of the game, i.e., 
‘orm [Obj L, Obj R] center value of [Obj L, Obj R] 
0.225490 2.725490, 6.725490] 
x = | 0.598039 3.901961, 5.549020] 4.725490 
~ [0.176471 
Table 3: Solution for player A 
Mixed strategy of player B, i.e., y | Value of the game in interval | Expected value of the game, i.e., 
orm [Obj L, Obj R] center value of [Obj L, Obj R] 
0.607843 —6.450980, —3.000000] 
y = 10.039216 —6.294117, —3.156863] —4.725490 
~ [0.352941 
Table 4: Solution for player B 
Example 3. 


Mixed strategy of player A, i.e., x 


[Obj L, Obj R] 


Value of the game in interval from 


Expected value of the game, i.e., 


center value of [Obj L, Obj R] 


x = | 0.290323 
- 0.161290 


vane 


(5.032258, 5.032258] 


5.032258 


Table 5: Solution for player A 


Mixed strategy of player B, i.e., ¥ 


Value of the game in interval from 


Expected value of the game, i.e., 


0.000000 


—_h 


[Obj L, Obj R] center value of [Obj L, Obj R] 
0.354839 
5 
y= ae [—5.032258, —5.032258] —5.032258 


Table 6: Solution for player B 


6 Sensitivity analy 


sis 


The outcome of this work is heavily dependent on the stability, convergence, 
and efficiency of the algorithm proposed by us. Hence, to study this stability, 
by considering Example 1, the sensitivity of the “expected value of the game” 
(from player A’s point of view) have been analyzed graphically with respect 
to GA parameters like pobcross, pobmute, maxgen and pobsize separately, 
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keeping the other parameters at their original values. Figures 3-6 reveal that 
the result of Example 1, obtained by the GA proposed by us, is stable over 
a large range of the GA parameters mentioned above. 


when pobcross = 0.95, pobmute= 0.15, 
maxgen = 1000 
0.156 


r 

cee 
pb 
uw 
Bb 


Expected value of the game 
la 
o 
i 
uv 
N 


popsize 


Figure 3: Expected value of the game for player A vs popsize 


when popsize = 100, pobcross = 0.95, 
pobmute = 0.15 


0.156 


0.154 


Expected value of the game 
oO 
i 
P~ 7 
» 


0.142 
250 450 650 850 1050 1250 


maxgen 


Figure 4: Expected value of the game for player A vs maxgen 


Figure 3 shows that when pobcrossis fixed at 0.95, pobmuteat 0.15 and 
mazgenat 1000, then the expected value of the game is fairly stable when 
the popsizeis equal to or above 43. Similarly, it is evident from Figure 4 
that when pobcross, pobmute and pobsize are kept at their original values, 
then once the value of maxgen goes above 452 mark. Moreover, the expected 
value of the game becomes perfectly horizontal and thus confirms that it is 
perfectly stable. Similarly, from Figures 5 and 6, it is clear that the expected 
value of the game is stable when the value of pobcrossis greater or equal to 
0.72 and the value of pobmuteis greater or equal to 0.1. 
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when popsize= 0.95, pobmute= 0.15, 
maxgen = 1000 

@ 0.15385 
E 0.153845 
® 0.15384 
= 

+ 0.153835 
a 

@ 0.15383 


a 
q 0-153825 


> 
= 0.15382 


$ 0.153815 
@ 0.15381 
a 05 06 O07 O8 089 


pobcross 


Figure 5: Expected value of the game for player A vs pobcross 


when popsize= 0.95, pobcross= 0.95, 
maxgen = 1000 


0.154 
0.1538 
0.1536 
0.1534 
0.1532 
0.153 
0.1528 
0.1526 
0 0.2 0.4 0.6 
pobmute 


Expected value of the game 
for playerA 


Figure 6: Expected value of the game for player A vs pobmute 


7 Conclusion 


In this paper, a heuristic technique has been developed to solve two per- 
sons m X n zero sum games (m > 2,n > 2), with interval-valued payoffs 
and interval-valued objectives. It is a well-known fact that during the design 
phase of the payoff matrix of a game rarely the available information is free 
from impreciseness. Because of this impreciseness, the payoffs of the pay- 
off matrix have a certain degree of ambiguity present in them. To address 
this impreciseness-driven ambiguity in payoffs and goals, since the end of the 
last century, several types of research have been done in two persons zero 
sum games with the help of fuzzy approach. However, as discussed in the 
“Introduction” section of this paper, the interval approach has some distinct 
advantages over the fuzzy approach. While dealing with impreciseness us- 
ing the interval approach, one can be absolutely sure that the interval result 
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will always contain the exact result irrespective of whether its upper bound- 
ary and lower boundary are overestimated. Hence, in this work, we have 
addressed the said impreciseness with the help of interval-valued numbers 
and thus formulated the game with interval-valued payoffs and objectives. 
To solve the game, a real coded GA with interval fitness function, tourna- 
ment selection, uniform crossover, and uniform mutation has been developed. 
Finally, a few numerical examples have been solved with the help of the de- 
veloped GA and to study the stability of the GA, sensitivity analyses with 
respect to different GA parameters have been done and shown graphically. 
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